#World Model
Research
PAN: A World Model for General, Interactable, and Long-Horizon World Simulation
Jiannan Xiang,
Yi Gu,
Zihan Liu,
Zeyu Feng,
Qiyue Gao,
Yiyan Hu,
Benhao Huang,
Guangyi Liu,
Yichi Yang,
Kun Zhou,
Davit Abrahamyan,
Arif Ahmad,
Ganesh Bannur,
Junrong Chen,
Kimi Chen,
Mingkai Deng,
Ruobing Han,
Xinqi Huang,
Haoqiang Kang,
Zheqi Li,
Enze Ma,
Hector Ren,
Yashowardhan Shinde,
Rohan Shingre,
Ramsundar Tanikella,
Kaiming Tao,
Dequan Yang,
Xinle Yu,
Cong Zeng,
Binglin Zhou,
Hector Liu,
Zhiting Hu,
Eric P. Xing
Nov 13th 2025
PAN brings imagination to life — fusing language, action, and vision to simulate the world's evolution with stunning realism and consistency.

Do Vision-Language Models Have Internal World Models? Towards an Atomic Evaluation
Qiyue Gao,
Xinyu Pi,
Kevin Liu,
Junrong Chen,
Ruolan Yang,
Xinqi Huang,
Xinyu Fang,
Lu Sun,
Gautham Kishore,
Bo Ai,
Stone Tao,
Mengyang Liu,
Jiaxi Yang,
Chao-Jung Lai,
Chuanyang Jin,
Jiannan Xiang,
Benhao Huang,
Zeming Chen,
David Danks,
Hao Su,
Tianming Shu,
Ziqiao Ma,
Lianhui Qin,
Zhiting Hu
This paper evaluates whether modern Vision-Language Models (VLMs) like GPT-4o and Gemini can act as internal world models (WMs)—systems that understand and predict the world.


